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Abstract —This paper investigates a channel-coded multiuser 
system operated with orthogonal frequency-division multiplexing 
(OFDM) and interleaved division multiple-access (IDMA). In 
general, there are many variations to multiuser systems. Our 
choice of the combination of OFDM and IDMA is motivated by its 
ability to achieve multiuser diversity gain in frequency-selective 
multiple-access channels. However, to realize this potential ad¬ 
vantage of OFDM-IDMA, two challenges must be addressed. The 
first challenge is the estimation of multiple channel parameters. 
An issue is how to contain the estimation errors of the channel 
parameters of the multiple users, considering that the overall 
estimation errors may increase with the number of users because 
the estimations of their channel parameters are intertwined with 
each other. The second challenge is that the transmitters of the 
multiple users may be driven by different RF oscillators. The 
associated frequency asynchrony may cause multiple CFOs at 
the receiver. Compared with a single-user receiver where the 
single CFO can be compensated away, a particular difficulty 
for a multiuser receiver is that it is not possible to compensate 
for all the multiple CFOs simultaneously. To tackle the two 
challenges, we put forth a framework to solve the problems 
of multiuser channel-parameter estimation, CFO compensation, 
and channel decoding jointly and iteratively. The framework em¬ 
ploys the space alternating generalized expectation-maximization 
(SAGE) algortihm to decompose the multisuser problem into 
multiple single-user problems, and the expectation-conditional 
maximization (ECM) algorithm to tackle each of the single-user 
subproblems. Iterative executions of SAGE and ECM in the 
framework allow the two aforementioned challenges to be tackled 
in an optimal manner. Simulations and real experiments based on 
software-defined radio (SDR) indicate that, compared with other 
approaches, our approach can achieve significant performance 
gains. 

Index Terms —multiuser detection, CFO compensation, OFDM, 
IDMA, space alternating generalized expectation-maximization, 
expectation-conditional maximization. 

I. Introduction 

T HIS paper investigates a channel-coded multiuser 
system operated with orthogonal frequency-division 
multiplexing (OFDM) and interleaved division multiple 
access (IDMA). While having many advantages, a major 
problem of OFDM-IDMA is the frequency asynchrony caused 
by the multiple carrier frequency offsets (CFOs) of the 
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signals simultaneously transmitted by the multiple users. 
We put forth a framework that jointly performs multiuser 
channel-parameter estimation, CFO compensation and 
channel decoding that addresses the multiple-CFO problem 
in a comprehensive and systematic manner. 

Why IDMA? 

For a multiuser system in which multiple users transmit 
simultaneously to a common receiver, IDMA is a technique 
for facilitating the separation of user signals at the receiver 
[ 1 ]. In IDMA, different transmitters interleave their channel- 
coded symbols in different ways before transmission to create 
orthogonality among user signals. In that light, it is similar 
to code-division multiple access (CDMA) except that CDMA 
creates semi-orthogonality with different spreading codes 
rather than different interleavers. Thus, it is not surprising 
that, as with CDMA, IDMA can provide multiuser diversity 
gains and mitigate inter-cell interference 0. 0- It has 
been shown that, all things being equal, IDMA outperforms 
CDMA in terms of error rate and receiver complexity [3]. 

Why IDMA with OFDM? 

This paper focuses on wideband communications. The 
channels in wideband communications are often frequency- 
selective because of multiple channel paths. The direct ap¬ 
plication of IDMA to wideband communications using time- 
domain signals leads to highly complex multiuser detectors 
jlj. This is because the multiuser detectors must deal with 
multiple-access interference (induced by multiple user signals) 
and inter-symbol interference (induced by multiple channel 
paths) at the same time. 

OFDM is a multicarrier modulation technique that can com¬ 
bat frequency selectivity in wideband channels. Specifically, 
OFDM divides a user data stream into many parallel sub¬ 
streams and transmits them over compactly spaced subcarriers, 
thereby converting a frequency-selective channel into a group 
of frequency-flat sub-channels j4]j. It is desirable to combine 
IDMA with OFDM, so that the multiuser detectors operating 
in the frequency domain only have to attend to multiple-access 
interference rather than both multiple-access interference and 
inter-symbol interference |5j]. 

A no less important advantage of OFDM for multiuser 
systems is that it can tolerate unaligned symbol arrival times 
among the multiple user signals at the receiver. Specifically, 
as long as the symbol arrival times of different user signals 
are within the cyclic prefix (CP) of each other, the signal 
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samples in the frequency domain are automatically aligned 
after the DFT 0. 

Major Challenges in OFDM-IDMA 

Despite its advantages, the OFDM-IDMA system is suscep¬ 
tible to multiple CFOs among the signals of multiple users. 
The multiple CFOs are caused by the different RF oscillators 
used at the different transmitters. CFO causes inter-carrier 
interference (ICI) among different subcarriers and induces 
cumulative phase drifts over the data frame (T|. 

Before the negative effects of CFOs can be alleviated, an 
issue is the estimation of channel parameters, which include 
the CFOs as well as the channel gains of different users. 
For multiuser OFDM-IDMA, the overall estimation errors 
may increase with the number of users. How to contain the 
estimation errors in OFDM-IDMA is a major challenge. 

With accurate estimates of CFOs, the next issue is to 
alleviate the negative effects of CFOs. A possibility is to 
attempt to compensate for them at the receiver. For single- 
user OFDM systems, the receiver can compensate for the 
single CFO by multiplying the time-domain signal (before 
DFT) with the complex exponent of the inverse CFO. This 
inverse operation cancels out the CFO. After that, the receiver 
performs standard channel decoding in the frequency domain 
to extract the source message. 

This method of separating CFO compensation and 
frequency-domain channel decoding, however, does not work 
for multiuser OFDM systems because of the multiple CFOs. 
Fundamentally, compensating for these CFOs simultaneously 
is impossible even if the CFOs were perfectly estimated 
without errors. This is because removing one of the CFOs 
in the received signal will necessarily leave behind some 
residual CFOs for the other CFOs, unless the CFOs of 
different users were exactly the same to begin with. As a 
consequence, the CFO-induced ICI inevitably remains in 
the frequency domain. Since the effects of all CFOs cannot 
be eliminated in one shot, we need an iterative method for 
multiuser joint CFO compensation and channel decoding for 
OFDM-IDMA systems. 

Our Approaches and Solutions 

We put forth a framework that jointly performs multiuser 
channel-parameter estimation, CFO compensation and channel 
decoding in an integrated manner. Our framework combines 
the space alternating generalized expectation-maximization 
(SAGE) (8) and expectation-conditional maximization (ECM) 
(9j algorithms. As far as we know, this is the first attempt to 
construct such a unified framework for OFDM-IDMA systems. 
Although SAGE and ECM algorithms are well known, many 
gaps, however, still need to be filled in order to build a 
complete and consistent framework with superior performance 
for multiuser joint channel-parameter estimation, CFO com¬ 
pensation and channel decoding. This paper fills such gaps, as 
elaborated below. 

The framework first employs the SAGE algorithm to de¬ 
compose the multisuser problem into multiple single-user 
problems, and iterates among single-user problems. Then the 
framework employs the ECM algorithm to tackle each of 


the single-user subproblems. Iterative executions of SAGE 
and ECM in the framework allow the two aforementioned 
challenges to be tackled in an optimal manner. 

After SAGE decomposes our multiuser problem into mul¬ 
tiple single-user problems, ECM solves the joint channel- 
paramter estimation, CFO compensation and channel decoding 
problem for each user. Here, a key element of our framework 
is to exploit the sum-product message passing algorithm (TO), 
EO for channel decoding to refine the channel-parameter 
estimation as well as the CFO compensation in an iterative 
manner. The sum-product channel decoding corresponds to 
treating the coded symbols as the hidden data in the ECM 
algorithm. We can also adopt anthoer message passing algo¬ 
rithm — the min-sum message passing algorithm (To), m 
— to perform channel decoding. We will explain in Section 
III.D that the min-sum channel decoding corresponds to a pure 
SAGE framework (rather than the SAGE-ECM framework). 
We will further show in Section IV of this paper that the 
proposed SAGE-ECM framework with the sum-product algo¬ 
rithm has better performance than the SAGE framework with 
the min-sum algorithm. 

To apply the sum-product algorithm in our framework, a 
subtle issue must be addressed. Specifically, the sum-product 
algorithm, when applied for channel decoding, yields soft 
information on the data symbols in the frequency domain. In 
our iterative framework, this data soft information will in turn 
be used to refine channel-parameter estimation. But the CFO 
estimation is only feasible in the time domain (this will be 
elaborated in Section III.C). Therefore, to use the data for CFO 
estimation, we need to first transform the data soft information 
from the frequency domain to the time domain. This problem 
can be treated as a soft IDFT problem: i.e., performing IDFT 
on probability functions. As will be elaborated, exact soft 
IDFT computation can be highly complex (of exponential 
order). To reduce complexity in soft IDFT, we adopt Gaussian 
message passing G3 to obtain approximate solutions. We 
show in Section III.C that Gaussian message passing reduces 
the complexity from exponential order to linear order. 

In our simulations, we show that our joint framework 
has around 5—8 dB SNR gain over conventional multiuser 
approaches for systems with 2—3 users. We further performed 
real experiments using software-defined radio (SDR) to verify 
our approach. The experimental and simulated results are 
consistent with each other. 

A. Related Works 

To our best knowledge, there is no such work that consid¬ 
ers the channel-paramter estimation, CFO compensation and 
channel decoding togather for multiuser OFDM systems. 

Multiuser decoding in IDMA is a method that jointly 
performs multiple-access interference cancellation and channel 
decoding (T), (2). Ref. (5) proposed the use of OFDM-IDMA 
for the multiuser communication. However, |5) assumed the 
absence of CFOs, and it directly applied the multiuser de¬ 
coding technique originally developed for IDMA 0, 0 
to OFDM-IDMA to deal with multiple-access interference 
in the frequency domain. Without CFOs, the application of 
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IDMA in the frequency domain is essentially the same as 
that in the time domain without multipath. As mentioned 
earlier, unavoidable multiple CFOs can cause both inter-carrier 
interference among subcarriers of the same user and among 
subcarriers of different users. Our work here investigates this 
fundamental yet practical issue. 

Subsequent to (13) and |T4) considered CFOs in 
OFDM-IDMA. The methods of |13), p4| cancel both inter¬ 
carrier interference (induced by CFOs) and multiple-access 
interference, and iterates between interference cancellation and 
channel decoding. However, perfect knowledge of CFOs and 
channel gains were assumed with no consideration given to 
their estimation. The methods of G3. 03 cancel both inter¬ 
carrier interference (induced by CFOs) and multiple-access 
interference, and iterates between interference cancellation 
and channel decoding. Our simulation results in Section IV 
indicate that imperfect channel-parameter estimation using 
preambles and pilots may cause significant performance penal¬ 
ties (more than 10 dB) to the system. This motivates us 
to improve the accuracy of channel-parameter estimation for 
OFDM-IDMA using not just the preambles and pilots, but also 
the data symbols in the signals. Doing so requires iterations 
between the channel-parameter estimation in the time domain 
and channel decoding in the frequency domain. 

An alternative multiuser scheme to OFDM-IDMA is orthog¬ 
onal frequency-division multiple access (OFDMA) uplinkPl 
As in OFDM-IDMA, the multiple CFOs in the OFDMA 
uplink cannot be compensated for in one shot at the receiver. 
Thus, in the presence of multiple CFOs, the user signals of 
OFDMA uplink will overlap in the frequency domain (i.e., 
these subcarriers are not strictly orthogonal due to the CFOs). 
The authors of 00’ 03 derived multiuser decoding methods 
for OFDMA to cancel the multiple-user interferences (induced 
by CFOs) and decode the data symbols in an iterative manner. 
As with the investigations of OFDM-IDMA in p3] , |14|, the 
investigations of OFDMA in |15j|, fl6) did not consider the 
impact of imperfect CFO and channel estimations. 

Recently, 03 proposed a method for multiuser joint 
channel-parameter estimation, CFO compensation and symbol 
detection for OFDMA. As with our current work, the method 
of 03 a ^ so a > ms t° combine SAGE and ECM for solving the 
problem. However, 03 did not consider the impact of channel 
coding, which necessitates a total recast of the algorithmic 
framework. In this work, we explore the proper way to 
combine SAGE and ECM for channel-coded OFDM-IDMA. 
The method of (T7) can also be modified for application to 
channel-coded OFDM-IDMA in a straightforward manner. We 
will show that a simple extension of the method of G3 to the 
channel-coded OFDM-IDMA leads to a worse performance 
than our method. 

The rest of this chapter is organized as follows. Section II 
describes our system model. Section III presents our frame¬ 
work and shows how to apply SAGE and ECM to OFDM- 

*We remark that in the OFDM-IDMA system of interest to us, all the 
subcarriers are used by all users. This is different from OFDMA, where 
different users use non-overlapping subcarriers. Multiple-access in OFDM- 
IDMA is achieved by means of user-specific interleaving in IDMA. OFDM- 
IDMA has better spectrum efficiency than OFDMA. 


IDMA for multiuser joint channel-parameter estimation and 
channel decoding. Section IV details our the simulation and 
experimental results. Section V concludes this paper. 

Notations: We denote matrices by bold capital letters, 
vectors by bold small letters, and scalars by regular letters 
throughout this chapter. All vectors are column vectors. The 
( i,j) th entry of matrix A is denoted by [A]—. In addition, 
A 1 , A h , A _1 and det (A) denote the transpose, the conjugate 
transpose, the inverse and the determinant of A, respectively. 
Re(-) means the real part of one complex number, and 
Z (•) is the angle of a complex number. CAf (x : m, K) = 


exp 


— (x — m) K 1 (x 


denotes the prob- 


7r r det(K) 

ability density function (PDF) of an r-dimension complex 
Gaussian random variable x with mean vector m and covari¬ 
ance matrix K. The Euclidean norm of a vector x is denoted 
by ||x||. Finally, © denotes the exclusive-or operation. 


II. System Model 

A. Transmit Signal 

Let us first look at the transmit side of our system model. 
In the uplink, U users transmit simultaneously to a base 
station. The transmitted messages employ OFDM signaling 
with IDMA. 

Fig. □ shows the block diagram of the system. User 
u, u £ {1,2,■■•,[/}, generates a sequence of J in¬ 

formation bits h u = [b Ut ib Uj 2 • • • b U: j] T . The information 
bits are then channel-coded into J/R code bits c u = 
\c u a c v .‘2 ■ ■ ■ c u j/r\ , where R is the code rate. We assume 
all users adopt channel codes that is amenable to decoding 
by the message passing algorithm G3, (nj. After channel 
coding, a user-specific interleaver permutes the sequence of 
code bits c u into an interleaved sequence of code bits c„ = 
[cu,iCu ,2 '• 'C u ,j/r] ■ The user-specific interleavers together 
with the channel encoder serve as the signatures of the users. 
Then, c u is modulated to a sequence of complex data symbols 
z„ = \z Ui iZ Ut 2 ■ • • z u ,j/rb\ , where B is the number of 
code bits per complex data symbol. For simplicity, we focus 
on BPSK modulation in this work. The extension to higher 
order modulation under the framework of bit-interleaved coded 
modulation (BICM) fl8| is straightforward. 

The complex data symbols are transmitted by means of 
OFDM signaling. OFDM transmits signal on a block-by- 
block basis. Each OFDM block contains N subcarriers. User 
u maps each element of z„ to a subcarrier. We denote the 
m th OFDM block of user u by a length-X vector X u m = 
[Xu, mi iX Utmi 2 ■ ■ ■ -V„ jmjAr ] T , where the i th element 
i £ {1, 2, • • •, A}, is the symbol transmitted over the i th 
subcarrier in the m th OFDM block of user u. 

For channel-parameter estimation, preambles are added to 
the beginning of the frame, and pilots are carried on selected 
subcarriers of every OFDM block. In our system, the first 2U 
OFDM blocks at the beginning of the frame are preamble 
blocks. User u transmits two identical training blocks over 
preamble block 2u — 1 and 2 u, and nulls its transmission over 
the other 2U — 2 preamble blocks. Thus, the preambles of 
different users are mutually orthogonal in the time domain. 
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Fig. 1. Block diagram of OFDM-IDMA. 


We stack the OFDM blocks of the overall frame of user u into 
a length-MiV vector X u = [X^X^ 2 ■ ■ • X ^ M ] T , where M 
is the number of OFDM blocks contained in one frame — the 
first 2U blocks are preambles and the next M — 2U blocks 
are data payload. 

We employ the following arrangement for the data and pilot 
subcarriers. The data subcarriers allocated to different users 
overlap completely. The index set of the subcarriers allocated 
to user data is denoted by Id- The pilot subcarriers allocated to 
different users do not overlap. The index set of the subcarriers 
allocated to user it’s pilots is denoted by Ip u with l r>d i\v = 
0 for Vm and Ip u ft /p jt) = 0 for u ^ v. User u maps its 
complex data symbols z u to data subcarriers according to: 
X U} m,i = z u ,i(m,i) for * £ Id, 2Z7+1 < to < M and 2 U +1 < 
to < M, where l (to, i) is the index of the complex data 
symbol assigned to the i th subcarrier of the m th OFDM block. 
User u fixes the known BPSK symbols on its pilot subcarriers 
to 1: X Ujm i = 1 for i £ Ip, u , 2 U + 1 < m < M and 2U +1 < 
to < M. On the other users’ subcarriers and the guard-band 
subcarriers, user u transmits dummy null symbols:X„ im) j = 0 
for i ^ Id U Ip >u , 2.U + 1 < m < M and 2 U + 1 < in < M. 

We denote the overall function that includes channel en¬ 
coding, interleaving, modulation, subcarrier mapping, pilot 
insertion and preamble addition for user u by C u , and we 
write X u = C u (b„) to express that the OFDM symbols in 
X„ are mapped from the information bits b u . 

The OFDM modulation is implemented by an N point IDFT 
^-u,m — F where x u m = 

is the vector of time-domain samples, and F is the 
N x N DFT matrix whose (p, q) entry is given by 
e -j2n(p-l)(q-(l+N/2))/N I 1 < p, q < N, 1 < p, q < N. 
We stack all the time-domain sample vectors of the whole 


frame into a length-M./V vector x u = [xy xJ 2 • • • xj M ] T ■ 

To overcome the delay spread of multipath channels, which 
causes inter-block interferences (IBI), the time-domain sam¬ 
ples of each OFDM symbol is preceded by a cyclic prefix 
(CP). We denote the length of the CP by N cp . Therefore, 
each OFDM block includes N s = N cp + N time-domain 
samples where Xu^m^ — for i £ 

{—Ncp + 1, • • •, 0}. The time-domain samples are converted 
to signal waveform via a digital-to-analog converter (DAC). 
The U users simultaneously transmit their signal waveforms on 
their respective multipath channels. The base station receives 
their overlapped signal waveforms. 

B. Channel Model 

We denote the overall discrete time-domain channel impulse 
response of user u that captures the effects of both the 
physical channel and the transmit/receive filters by h u = 
[h u .ih Uy 2 '' • /iti.ij 1 , where h u ^i is the I th discrete tap of 
the multipath channel of user u, l £ {1, ■ ■ ■, L u }, and L u 
is the maximum channel delay spread. We assume that the 
channels remain static over the transmission time of one frame. 
This assumption is valid in scenarios where users exhibit 
low mobility G£). We model the channel taps as mutually 
independent complex Gaussian variables with zero mean and 
an exponentially decaying power delay profile: E ||/iu.z | 2 j = 
j3exp{—{l — l)/L u ), where j3 is a normalization factor to 
ensure that the average channel energy is one. 

C. Receive Signal 

The duration of a time-domain sample is denoted by T s . The 
the timing mismatch between user u and the base station is 
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denoted by r„. For convenience, we define the zero reference 
time as the first-channel-path arrival time of user 1 (ti = 0 ), 
and assume without loss of generality that the first-channel- 
path arrival times of all other users are later than that of 
user 1 (t u > 0, u £ {2}). Following [ 171, j20) , we 
decompose t u into an integer part plus a fractional part with 
respect to the sample duration T s : t u = (/ i u + S u ) T s , where 
/i„ = Y t u/T s \ is the integer part and S u = t u /T s — is the 
fractional part. The fractional part can be incorporated into the 
channel impulse response, thus we will not consider it going 
forward. We can avoid IBI in the system by assuming that 
the system satisfies a loose time synchronization requirement, 
specified as max M {/i u + L u } < N cp . As long as this require¬ 
ment is satisfied, symbol misalignment in the time domain 
does not affect the subcarrier-by-subcarrier channel decoding 
in the frequency domain other than introducing relative phase 
offsets between the signals of the users — in particular, 
there is no symbol misalignment in the frequency domain and 
signals on different subcarriers are isolated from one another. 
Note that there is no need to explicitly estimate the timing 
parameters {r u }^ =2 . To meet the loose time synchronization 
requirement, the base station can broadcast a downlink beacon 
to prompt the users to transmit together ©, (T7), ( 2 TJ. 

The presence of CFOs destroys the perfect orthogonalities 
among subcarriers. A main focus of this paper is the study of 
the effect of CFOs (i.e., the frequency asynchrony problem). 
After analog-to-digital conversion and removal of the first N cp 
CP samples (counting from the first sample of the first channel 
path of user 1) of every OFDM block, the received discrete 
time-domain samples at the base station can be expressed as 


is given by 


r m = ^2 e J 9 ^ 

U= 1 


r (e u ) F H F) (X u , m ) Fh u + n, 


(1) 


for m = 1 , 2 , 


■, M, where 


subcarrier space) 

u\ T(e„) = 


, A 

h„ = 


o'! 


h T n T 

’ n u 1 u AT-L„-p„ 


n T 


R, 


u 

= E e 


FT (e u ) F h D (X. 


u 

= E e 

u—1 
A 





jBu 


1 3 (£u) D (X-u,m) H„ + N, ; 


( 2 ) 

where H„ = Fh,, is the vector of the frequency channel 
responses for user u, N m = Fn m is the vector of frequency- 
domain noises, S (e u ) = FT (e u ) F H is the matrix that 
transforms the effect of CFO e u in the time domain to ICI in 
the frequency domain. It can be shown that S [£ u = 0) = I; 
3 (t„ 7 ^ 0) = AI + II (e u ^ 0), where A is a common 
attenuation factor across all subcarriers, and II (e u f 0) is the 
ICI component |22j. 

We stack the M length-A' time-domain (frequency-domain) 
sample vectors into an overall length -MN sample vector 


A 

r = 


„T_T 


L MJ 


(R = Rf R 


■ R^j] ). The receiver 


r m = [r m ,ir m ,2 ■ ■ ■ r m ,N} is the vector of the N 
discrete samples of the m th received OFDM block; 
n m = [n mj in mj 2 ■ • • n mj jv] T is the vector of the complex 
white Gaussian noises with zero-mean and variance of 

‘i. 


e u is the CFO (normalized to 
between the base station and user 
diag { [l, e j27r£ “/ jV , • ■ ■, e j 27 rE “( JV ” 1 i/ JV ] } is the diagonal 
matrix that captures the time-domain effect of CFO on 
the m th block of user u\ 

Q u ,m = ^Tt£u{N cp + mN s )/N is the accumulated phase 
drift caused by CFO e u on the m th block of user it; 

D (X MjTO ) = diag (X„ im ) is the diagonal matrix with 
transmitted frequency-domain symbols X MjTO as its diag¬ 
onal elements; 


is the length-A vector that 
captures both the discrete CIR and the time asynchrony 
of node u. 

For OFDM systems, channel decoding is performed in the 
frequency domain. The frequency-domain sample vector of the 
m th OFDM symbol, R m = [R m ,iR m ,i ■ ■ ■ Rm,N] T = Fr m , 


then undertakes two tasks: (i) estimation of channel param¬ 
eters {£ u ,iiu}u=y ® decoding of multiuser information bit 
sequences {b „}„ =1 . 

With reference to the signal model in (|2]), we can observe 
that CFO causes two negative effects on the frequency-domain 
signal: (i) the drifting of the signal phase over time; (ii) the 
inter-carrier interferences (ICI) among different subcarriers. 

For conventional point-to-point OFDM systems, the receiver 
can first estimate the CFO and the channel gain from the 
preamble and then compensate for the CFO for the whole 
frame in the time domain. That is, it attempts to remove the 
CFO in the signal using the estimated CFO. After that, it 
performs standard channel decoding in the frequency domain. 
Estimation error may leave behind an uncompensated residual 
CFO. If the estimation is accurate enough, the residual CFO 
will be small, and the remaining CFO-induced ICI will also 
be small. We can treat the residual ICI as additional noise 
that lowers the effective sieual-to-noise ratio (SNR) slightly 
0 , but standard channel encoding/decoding can still be em¬ 
ployed to ensure communication reliability |23) . This CFO 
compensation alone, however, cannot overcome the drifting 
phase. Even a small residual CFO can lead to large phase drifts 
accumulated over time. Standard channel decoding will fail if 
we ignore the phase drifts. Therefore, point-to-point OFDM 
systems typically employ pilot subcarriers in the OFDM block 
to track signal phase so that it can be corrected. G9- 

The above design principle, however, does not work for the 
multiuser OFDM-IDMA system. In particular, there are mul¬ 
tiple CFOs in OFDM-IDMA, one for the signal of each user. 
First of all, accurately estimating these CFOs and the channel 
gains is demanding. More fundamentally, compensating for 
these CFOs simultaneously in one shot is impossible even if 
the CFOs could be estimated without errors (by contrast, for 
conventional point-to-point systems, total CFO removal is pos¬ 
sible given perfect CFO estimation). This is because removing 
one of the CFOs in the received signal will necessarily leave 
behind some remaining CFOs for the other CFOs, unless the 
CFOs of different users were exactly the same to begin with. 
As a consequence, the CFO-induced ICI inevitably remains in 
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the frequency domain. Instead of channel-parameter estima¬ 
tion, followed by CFO compensation, followed by channel 
decoding, that are typical in many single-user systems, an 
iterative method is called for in a multiuser system such as 
OFDM-IDMA. In what follows, we propose a SAGE-and- 
ECM framework to address the joint problem of multiuser 
channel-parameter estimation, CFO compensation and channel 
decoding. 

III. Multiuser Joint Channel-Parameter 
Estimation, CFO Compensation and Channel 
Decoding 

To improve the overall performance of the OFDM-IDMA 
system, we make use of the preambles, the pilots as well as the 
data payload of the received signal to jointly estimate channel 
parameters and decode information bits. 

For OFDMA receivers, ED developed an iterative solution 
that uses SAGE and ECM. However, ED did not consider 
the impact of channel coding, the good performance of which 
necessitates a total recast of the algorithmic framework. In this 
work, we explore the proper algorithmic framework of SAGE 
and ECM for channel-coded OFDM-IDMA systems. We argue 
that incorporating channel decoding by simply extending the 
framework of ED leads to subpar performance. 

We formulate our target problem as a multidimensional 
estimation problem in Section III.A. Section III.B gives an 
overview of a SAGE-based signal decomposition method. As 
in 03’ we apply SAGE and break down the overall problem 
into U sub-problems, one for each user, by decomposing the 
received signal into U signal components. Readers who are 
familiar with SAGE decomposition can quickly go through 
Section III.B to familiarize themselves with our notations. Sec¬ 
tion III.C zooms in to the ECM-based joint channel-parameter 
estimation, CFO compensation and channel decoding problem 
within the U sub-problems. The main contribution of our work 
is contained in Section III.C. In particular, a key issue is how 
to incorporate message passing algorithm for channel decoding 
into the joint framework. In that regard, two message passing 
strategies are possible for channel decoding: sum-product and 
min-sum message passing G9> 03- In our framework, the 
ECM algorithm gives the sum-product channel decoding. We 
show that the integration of sum-product channel decoding 
with channel-parameter estimation leads to better system 
performance than min-sum channel decoding. However, for 
compatibility with the channel-parameter estimation part, the 
sum-product channel decoding cannot be applied directly: a 
transformation of the soft information from the time domain to 
the frequency domain is needed. We develop a new technique 
called soft IDFT to realize an overall compatible algorithmic 
framework. Exact computation of soft IDFT, however, is 
complex. Here, we obtain an effective approximate solution 
by Gaussian message passing. 

A. Problem Statement 

As expressed by the signal model in 0. the only un¬ 
known channel parameters are the CFOs {e u }^ =1 and the 
channel gains {h u }^ =1 . In (JTJl, the phase drift 0 urn = 


2tt£ u (N cp + mN s )/N is not independent and is a function 
of CFO e u . However, if we only estimate e u and derive the 
estimated phase drift d u m based on the estimated e u , even a 
tiny estimation error in e u will accumulate to large estimation 
errors in for later blocks m. Furthermore, the expression 
0 u ,m = ‘2n£ u (N cp + mN s )/N assumes that the signals of 
different users do not incur different phase noises and the 
phase d u m is strictly due to that of the CFO. In practice, 
the phase drift is not due to CFO alone but also due to 
phase noise that behaves like a random walk |24j. Thus, in 
this paper, for a robust system, we do not make use of the 
expression d Uy7n = 2 tte u (N cp + mN s )/N when estimating 
we assume that {0„ m } are independent for different 
OFDM blocks and different users, and estimate 0 u _ rn of each 
block independently. The advantage of this scheme is that 
phase errors due to estimated CFO errors are not cumulative 
over blocks and random phase noise can be taken into account. 

With the above, the overall unknown variables in the system 

aie *! E U) rn=l ’ X u f 

l. . ) w= 1 

According to the maximum likelihood (ML) principle, we 
can express the objective of the joint channel-parameter esti¬ 
mation and channel decoding problem as 


&ui i @u,m r j b u , 

t J m= 1 


X, 


u =1 i 


= arg max 


logp r 




= arg mm 

{eu,{0«,m}" =1 ,h„,XeC u }^ =l 


M 

E 

m =1 


I’m - E & 

U—l 


jOu 


r (e u ) F h D (X Ujm ) Fh ( 


(3) 

With regard to 0 we emphasize that since the sequence 
of transmit symbols is generated from the sequence of 
the original information bits via a one-to-one mapping, 
i.e., {X.„ = C u (b u )}^ =1 , decoding the transmit symbols 
{X u }^ =1 is equivalent to decoding the information bits 
{b u } u==1 . Directly solving the ML problem ([3|> is intractable 
because the exhaustive search ove f i; the multi-dimensional 
space of i e u , {0 Ut m}™ =1 , h u , X u i is prohibitively com- 

y u —i 

plex. 


B. Preliminary for the Signal Decomposition Using SAGE 

In |25) , the authors solved the problem of multiple param¬ 
eter estimation (but not channel decoding) using a iterative 
method. Later on, the method of |25) evolved into the SAGE 
algorithm |8j. The key idea is to decompose the received over¬ 
lapping signal into several signal components. Subsequently, 
E3 applied the method of |25) to OFDMA systems. Since the 
signal decomposition is performed in the time domain, where 
the user signals overlap completely (for both OFDM-IDMA 
and OFDMA), we can directly apply this SAGE-based signal 
decomposition to our OFDM-IDMA system. This subsection 
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extends the SAGE signal decomposition in ]25) to incorporate 
channel decoding. 

First, the overall estimation problem (Rli is decomposed into 
U sub-problems [17|, [251. For the mr 1 OFDM block, the 
signal component of user u is defined to be 


r e o9u 

L u.m — 0 


r {e u )F H D (X„ >m ) Fh„ + n u 


(4) 


where {n U|m } j are obtained by arbitrarily decomposing 
the total noise vector n TO into U circularly symmetric and 
statistically independent noise component vectors that satisfy 
n m = Eu=i n u,m [251. The received signal can then be 
written as 

E u 

Yu,m (5) 

U—l 

where r m is decomposed into U components , with 

each being exclusively related to one user. As with the stacking 


of the overall received signals r = [rf ■ ■ ■ i'X/J i n the 
whole frame, we can also stack the M signal component vec- 


l u,Mj 


tors of user u into an overall vector r u = [r^ 1 2 

u £ {1,2,---, C/}. In the terminology of SAGE, r is the 
observed data and {r u }^ =1 is the complete data. 

SAGE tries to find jjte ML estimates 


{£u, {0u,m}Z=l, h u ,X„} 


ck k ) 

t-U 


(n( k) \ 
5 1 Vu,m r 


M 


U/, , -**-u 


w 


5=1 


iteratively 


for 

Let 


be the updated estimates 


after the k th SAGE iteration, where k = 1,2, ■■■K and 


s(o) 


/ 2 I°) 1 

i \ u u,,m r 


M 


:(o) ^(o) 


are the initial estimates. 


The initial estimates for the CFC)s and the channel gains are 
obtained from the orthogonal preambles; the initial estimates 
for the phase drifts are set to zeros; the initial estimates for 
the transmit symbols are set to ±1 randomly. Each SAGE 
iteration consists of U cycles; the variables of user u are 
updated in the u th cycle given that the variables of all other 
users are fixed to their last estimates. 


With 


the 


initial 


estimates 


U°) (m \ 

S tu , S v u ,m 1 


M 


;(o) o(o) 


the SAGE algorithm 


first computes the initial estimates for the individual user 
signal components: 


0 ( 0 ) =e j§i o) m 

u.m 0 


( 4 °)) F b D (x ( u 0) ) Fhl°) (6) 


where m = 1,2, • ■ •, M and u £ {1,2, • ■ •, U}. The u th cycle 
of the k th SAGE iteration includes an E-step and an M-step 
as follows; 


E-step of SAGE: 

Compute the tentative estimate for the signal component of 
user u: 


u-l u 

f( fc ) =r r (fe_1 ) ( 1 ) 

u,m m /j v,m / j v,m 
v—1 v—u-\-l 

2 This means that the setup of SAGE here treats all variables { e u , 
{Ou,m}m =1 ,h u ,X u | , including {X u }^ =1 as the parameters; there 
is no hidden data in the setup 0- £3 


where m = 1, 2, • ■ •, M, and = 0 if u < v. Note that this 
computation is based on the signal decomposition in (|5j. We 
stack the estimates for all the OFDM blocks into an overall 

„ 1 T 


. -(fc) A 

vector r„ ' = 


9 (fe)T 9 (fe)T 
l u, 1 l u, 2 


? (k)T 

l u,M 


M-step of SAGE: 

Update the estimates for the variables of user u: 


£^k) f 7j(k) 1 r(fc) 

l J m—1 


( k ) 


X 


= arg max 

(E„,{8u,m}" =1 ,h„,Xu£C„) 


log P S u , {O u ,m}m=l > h u, X u j | 


= arg mm 

(e„,{0„, m }Jf =1J h u ,x u ec u ) 

M 


E 

m—1 




r( £u )F ff D(X u , m )Fh, 


( 8 ) 

After the M-step, we then reconstruct the estimate for the sig¬ 
nal component of user u using the updated variable estimates 


0 ( 0 ) =p j9i k l T 

L u.m 0 


(£< fc )) F h D (xl fc )) Fht fc ) (9) 


for m = 1, 2, • • •, M. This completes the u th cycle of the k th 
SAGE iteration; we then proceed to the cycle of the next user. 
After the variables of all users are updated to 

l^u\\o i £,li\ , x . ( u k) | , we then proceed 

l l J m—1 J u= 1 

to the next SAGE iteration. When the number of iteration 
k reaches a preset maximum limit K, we terminate the 
SAGE algorithm after obtaining the final variable estimates 

^ ,h-E 1 .X, ( E| . According to the 

theory of SAGE (8), it is expected that the final variable 
estimates will converge to the global optimal as required by 
the ML estimation in (|3j. 

As seen above, SAGE decomposes the multiuser problem 
of joint channel-parameter estimation and channel decoding 
in 0 into U single-user problems in 0- The complexity 
of the multiuser problem is reduced substantially. However, 
the computation involved in the single-user sub-problems as 
expressed in ([8]> is still non-trivial. For each user, we need 
to solve a multi-dimensional problem associated with simul¬ 
taneously estimating je„, , h u , X u j. This is of 

high complexity, particularly if the data symbols {X„}^ =1 are 
channel-coded symbols and we want to exploit the correlations 
among the symbols induced by channel coding to optimize our 
estimation. 

A simplified approach to solve ([8J is to estimate the vari¬ 
ables in |e u , {0u,m}m-i ’ X u j- one at a time sequentially 
and iteratively. When one variable is under estimation, all other 
variables are fixed to their estimates from the last iteration. 
This approach is straightforward with an important caveat: it 
does not use the information obtained from channel decoding 
in an optimal way. We will further elaborate on this simplified 
approach in Section III.D and will treat it as a benchmark for 
evaluating our approach to be presented in Section III.C. To 
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make better use of the information from channel decoding. 
Section III.C will construct a more comprehensive approach 
to our problem using the ECM algorithm. 


C. Joint Channel-Parameter Estimation and Channel Decod¬ 
ing Using ECM 

ECM is a variant of the expectation maximization (EM) 
algorithm, a general iterative algorithm for finding the ML 
estimates of parameters in a statistical model with hidden data 
|26 |. EM updates all parameters in the model simultaneously 
at each iteration. This requires EM to operate in a multi¬ 
dimensional space. To reduce complexity, ECM updates the 
parameters sequentially. At each stage of the update sequence, 
ECM updates just one parameter, fixing the other parameters 
to their last estimates ©• 

Within the framework of ECM, we could assign the role 
of hidden data and the role of parameters to the variables 
{e«, , h„, X.A in different ways, each yielding a 

different implementation. In that sense, it is more general than 
0 , in which all variables are treated as parameters. 

In this paper and the rest of this section, we focus 
on the assignment that treats X u as the hidden data, and 
je„, {6u,m}m=i > as the parametersJ^J With this assign¬ 
ment, we can incorporate sum-product channel decoding into 
the joint framework. Now, ECM seeks to solve the following 
maximization problem: 


A k ) 

tu 


5 i u u,m i 


M 

m—1 


:(k) 


= arg max 

= arg max 


logp fi 
=1> h„) v 


:(k) 


1 ; h-u 


log X] P (ri fc) , X u £ u , {dr , h„, C u ) | 


( 10 ) 

Note that problem ([TO]) is different from problem ([8]) , where 
all variables 

treated as parameters to be estimated. 


{eu,{e u ,m}Z=l^u, X u | (including X u ) 


are 


After ECM finds 




5 1 u u,m i 
l ) m—1 


:(*) 


/ £ sq \ M hi — \ ,V ' 

\ u u,m j m= i » L1 u j — i ^Vu,m j 


'J 


\ in ( 

10 

J 

M 



(k) 


then set 

in the signal model (|2| and then perform sum-product channel 
decoding to find the a posteriori probabilities (APPs) of the 

Ak) f7){k) \ C(k) (' 

L u i i i Vu.m f i ll u 5 '-'i 

l J m—1 

for all m and i. For example, if the convolution code is 
used, the corresponding channel decoding algorithm is the 
BCJR algorithm j27). The estimates for the symbols in X„ 
are obtained by making hard decisions based on the symbol- 


coded symbol p X u rn 


3 The assignment that treats h u as the hidden data and 
e u , , X„ > as the parameters is discussed in Appendix 


wise APPs: 


(fc) 


X. 


= arg max p 

X u rn i 


X,, 


Ak) Ak) 

1 U t 


(hi*) \ 

i vu,m ( 


M 

m—1 




(fc) 


for all to and i into 


We stack all the symbol estimates X, 

^ (fc) ^ (k) 

a vector X,, , and X u is treated as the estimate for X„ 


Finally, the parameter estimates 


A k ) 
£> 


®) (A k ) \ 

5 S Uu,m r 


M 


:(k) 


(k) 

by ECM together with the hidden data estimate X„ ’ by 
channel decoding are treated as the overall solution for 


A k ) 
£> ' 


c) {g[k) X 

5 i Vu,m r 


M 


C(*0 v [k) 

, J1?A , -A-7I 


. We will see that the ECM 


and channel decoding can be integrated into one framework 


to assist each other for finding 


Ak) 


Afk) \ 

5 i u u,m C 


M 


:(fc) 


and 


^ (k) ^ (k) 

Xu . Here, we emphasize that X,, 1 are the decoded soft 

information (APPs). We will show in the following how to use 
the APPs to refine the estimates of the parameters through the 
ECM iterations. 

Within the k th iteration of SAGE, the ECM algorithm for 
solving each sub-problem in 0 consists of Z iterations. We 
collect the parameters into a set fl u = |e u , {0,, m }^ =1 , h„ |, 
and denote the updated parameter estimates after the 


z"'" ECM iteration within the k th SAGE iteration by 


n 


th 

( k,z ) 


= <si k 


, z ) 

-> Uu,m i 


M 


'(k,z) 


z = 1,2,- 




1 m=l 

The z th iteration of ECM within the k th SAGE iteration 
consists of an E-step and an M-step, as follows: 


E-step of ECM: 

Given the parameter estimates from the previous ECM it- 
eration f l y u ’ , the E-step of ECM aims to compute the 
conditional expectation defined by |9]. 


Q (0- v 




{k,z— 1 ) 


E p( 

Y \ 


X„ 


C-(^) r\(k,z — 1 ) ^ 

i u •) ^ “U 5 v 


log p (r ( u k) 


( 11 ) 

This conditional expectation is called the Q function in the 
EM literature. We remark that the decoded soft information 
of the transmitted symbols, if obtainable, is used to assist the 
estimate of the parameters. With this setup, the data symbols 
serve as the hidden data in the ECM framework. Since the 
expectation (the summation) in © is taken over the data 
symbols in the frequency domain, we might be tempted 
compute the above Q function in the frequency domain. 
However, we will see that this does not work. With a new 
approach, we can compute the Q function in the time domain. 


How to decompose Q function in time domain 

A brute-force attack that attempts to compute the Q function 
in © directly will meet the following obstacles: (i) the 
ensemble of codewords X u to be summed over in © 
is very large (in fact, exponentially large in terms of the 
number of symbols in X u )\ (ii) barring exhaustive enumer¬ 
ation, there is no known decoding algorithm that can give 
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r[ k \ vJa’ z 1 ■*, C'u j for all these codewords; (iii) there 

is no concise expression for p |Q U , X u ^ due to the 
coupled terms brought about by the ICI effect. Ref. flT) did 
not take these obstacles into account, since it investigated 
uncoded systems. In order to obtain a practical solution for 
eu- in this work, we transform the computation process to 
the time domain, where the Q function can be decomposed 
into many sample-wise factors. 

We define the frequency-domain compound symbol of the 
subcarrier symbol and the channel gain as 


y — X 


Given the channel gain H u i , Y v rn l £ {±} when BPSK is 
adopted. The vector of frequency-domain compound symbols 
for the m th OFDM block is 


Y u,m — [Lz,m,lF u,m,2 ' ' ' Y u,m,N\ — ditty H u , 


the corresponding vector of time-domain compound samples 

tS yu,m — [yu,m,iyu,m,2 ' ' ' yu : m,N ] — F Y u,m* and 

the vector of frequency-domain compound symbols (time- 
domain compound samples) for the whole frame is Y„ = 

[ y u,i y u,2 ■ ■ • y Zm\ t (y u = [ylpylp ■ ■ • y u,m\ 1 )■ Bearin 

mind that given the channel gains H„, the knowledge of the 
compound symbols Y u is equivalent to the knowledge of the 
transmit symbols X„. We can therefore rewrite the Q function 
in ([ 11 ]) as 






a 


( k,z— 1 ) 


gM 


^(k) f\(k,z — l) 

Tu , fiu 


C-(^) 1) /~i 

*-U j 


■)} 

logp |f 2 „,y u )j 


log p |fi„ 


Next, we decompose the probability functions in (12 1 to 
compute the time-domain Q function. We denote the i th 
(the estimate for the i th sample of the to 1 

u.m 


element of r „\ n 


th 


and decompose p ( r 


*(*0 


Yu J 


M N 


as 


m =1 i=l 


where the sample-wise factor is given by 

= CN : e j 0 "’™e j2ns ^ i - 1 ')/ N 


yu,m,i-> 


for all m and i. Substituting ( p~3| ) into ( | 1 2\ , we obtain 

Q(n u h i u k ' z ~ 1) )= EE iogp (ri%,i \^u,yu,m,i) 

' ' m— 1 2=1 y u ,m,i 

x E p(y«|f < tl fc) ,n^*- 1) ,c- tI ) 

(y 


=p(y u , m M k) 


(15) 

where {y u y u ,m,i} is the set that contains the all elements 


(fc) 5(fc,z-l) 


- U 7 Ylu 


,Cu) 


in vector y u except and p {y u , m ,i 

is the sample-wise APR We define the sample-wise Q function 
as 


Qm,i ^ 


a, 


SI 


(fc,z-l) \ A 


) 


E lo SP I &u,yu,m,i)p {yn 


(k) Q(fc,z- 1 ) 

U 5 i C'U ■ 


C u J 
(16) 

Finally, the overall Q function is the sum of sample-wise Q 
functions: 

M N 

q (n u \ni k ’ z ~v) =J2Y, Q"* ( n « ) (17) 

m= 1 2=1 

As can be seen from the above, unlike the frequency-domain 
Q function, the time-domain Q function can be computed 
on a sample-by-sample basis. This greatly reduces the 
computational complexity. 


How to transform APPs from frequency domain to time domain 
The next question is how to obtain the APPs of the 
time-domain samples |p (y Ui m,i 
in ©■ Now, the symbols are 
transmitted in the frequency domain; after DFT the 
receiver performs channel decoding to find the APPs of 


(fc) Q(fe,Z—1) \ ) 

iu : '-'u ) f 

channel-coded and 


the transmit 


symbols j p (x u ^i 


(fc) 5(fc,z- 1 ) 


L U 7 Yl u 


( 12 ) 

where the second equality is due to that the mapping between 
Y„ and y„ is a one-to-one correspondence. 


Given the estimated 


H 


(k,z— 1) £f(k,z—l) 

■tin. 9 


channel 

T 


gains H 


H, 


(k,z— 1) 
u,N 


=Fh 


(k,z—l) 


,c, 

(k,z— 1) 

this 


)}■ 

A 

is 


equivalent to finding the APPs of the frequency-domain 
compound symbols, i.e., p ^X„ j77V 

P (Yu,m,i , C u 1 where Y„ 

(k,z— 1) 


OFDM block of user u) by . Since CFO only introduces 
a linear phase drift in the time-domain signals (and not ICI), 

sJk) ^(k) 

we can immediately decouple the components T in r „ 


X, 


,h: 


c.(k) q(/c,z—1) \ _ 

1 IL j M'U 7 J - 


p (*u } l n «,y«) = II II p \ n u,yu,mp) (13) 


u,m,i-L-L u i for all to and i. The algorithm to achieve 
this decoding objective is the general sum-product algorithm 
©7 HD. 

There is no known technique, however, for decoding the 
time-domain samples {y u ,m,i}- In this work, we introduce 
a new concept called soft IDFT to transform the APPs 
obtained by the sum-product decoding algorithm from the 
frequency domain to the time domain. At first sight, we might 
be tempted to obtain p 


P Y u 


(k) Q(k,z- 1 ) 
U 


( Y 

\ A u t r 


;(*0 


q(/c,Z—1) ^ 

^7 


from 


C u ) by considering the linear trans- 

_ pffy 

u.m. 


( 14 ) 


formation between y Ujm and Y u m , i.e., , y„ 
and then deriving the APP of sample y u , m ,i by marginalizing 
out all other samples over p (y u , m rE, f 1 \C U \ There 
are two obstacles to this approach. First, it is hard to obtain the 
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APP of the vector of symbols p ^Y Ujm r i k \fli k,z 1 ^, C\ 
(the sum-product algorithm, for example, does not give this 
APP). Second, the aforementioned marginalization will intro¬ 
duce intractable complexity that is in the exponential order of 
the DFT size N. Here, we employ a Gaussian message passing 
approach to solve the soft IDFT problem approximately. 

We approximate the APPs of the frequency-domain symbols 
as independent Gaussian distributions. The independence can 
be achieved by the operation of interleaving in the transmitter. 
The Gaussianity is an assumption made to simplify computa¬ 
tion^] The approximate APP of Y u ^ m ^ is given by 




y;(k) q (&— 1 ) ( 1 


m Y u 


n U Y U 


CM ( Y u ,m,i ■■ 

nee 

— ST' y (y • ro(M-i) n \ 

" L Yu,m,i — ' J Y . u,rri ’ l * J \ U,771,1 L u 1 “ L U ) V -'14 J 


(18) 


with mean and variance 


u Y u , mti ~ 

- my„ m1 || 2 p (Y u ,m,i 


YM) t) 

1 U , 


which are computed using the APPs delivered from the 
sum-product channel decoding. Then, with the independence 
assumption, we can write the APP of Y u m as 


P ( Y u r 


(k) ^(fc,2-l) 


,c v 


N ~( 

~ n P ( Y u ,m,i 
= CM (Y u ,m ■ m Y u , m j Cy 

with mean vector and covariance matrix 

A r i T 

= l m Y l 


(k) A(fc,2-1) 


- U ; Cl u 


) 


Yu) 


(19) 


rriY, 

Cy u m = 


diag ^ i 


i m Yu 


■ m y- 


Finally, since y„ m = F" Y„ m and F" is unitary and Y v m 
is assumed to be Gaussian distributed, the APP of y um is 
given by 


p (y u , ro |r« ^- 1} , C u ) = CM (y u , m : 


with mean vector and covariance matrix 


m v 


( 20 ) 


= F H rriY„ 

= F h CyJ 


F 


distribution ]28) , and the APP of y u mi is immediately given 
by 

P ( yu,m,i c) = CM (y Uim ,i : 


( 21 ) 


with mean and variance 


N 


= [ m y u , m ]i = 7m T, ° j27r(i 1)(j 1)/N m Yu 


j=i 


We will see later that when we use the Gaussian form of the 
APP shown in 0 for computing the Q function, only the 
mean m Vu m . has impact on the actual form of the Q function 
and the variance cr 2 u can be dropped. Since the mean of 
Vu,m,i is easy to compute because it is a linear combination 
of the means of the complexity of soft IDFT is 

reduced to the linear order of N by Gaussian message passing. 


How to obtain frequency-domain APPs 

We have considered how to obtain the time- 
domain sample-wise APPs from the frequency-domain 
symbol-wise APPs, given that the symbol-wise APPs 


P 


X u 




(k) Q(k,z — 1) \ \ 

L u A Lu , I 7 


are already 

computed by the sum-product channel decoding algorithm in 
the frequency domain. 

To compute the symbol-wise APPs in the frequency domain, 
we need the estimates for the frequency-domain signals of 
user u. We have already computed the estimates for the time- 
domain signals |r u!r„,j as in (b i. Before transforming Yuln | 
into the frequency domain, we compensate for the CFO using 
the CFO estimate s,y z ' from the last iteration. Then, we 
perform DFT on the compensated estimates for the time- 
domain signals 


= F ( r ( 


_Yk,z- 1) 




for to = 1, 2, • • •, M. After that, we can obtain the evidence 
information from | Ri/'m | 

■)(k’Z~ 1) A 


P 


Y O'- 

“ L u 


( 7?^) 

l - rt u,ra, 

CM 




X 


u,m,it @ 1 N 


( 22 ) 


where i = 1,2, ■ ■ ■, N, m = 1,2 
>( k ) 


. M R (fc) ■ 


is the i th 


j N is the variance of the residual 


A nice feature of Gaussian distributions is that every marginal 
distribution of a joint Gaussian distribution is itself a Gaussian 

4 Without the Gaussian assumption, we can solve the soft IDFT 
problem on a factor graph using the general message passing algo¬ 
rithm. We first construct the factor graph by considering the butterfly 

graph of FFT. With (p (Y Utm}i |ri*^, , C u ) ) as the in- 

put messages, we use sum-product rule to compute the output messages 
| p (y u ,m,i 1} ,C)} exactly. However, the number of mes¬ 

sages on the graph increases in an exponential order as we progress from the 
input to the output of the IDFT factor graph (i.e., unlike the binary Y u ,m,i> 
Uu,m,i has 2 n possible values.). We will treat this problem in our future 
study. 


element of Rn,m, and a 
interference plus noise. The residual interference remaining in 
| R^m | includes the residual inter-carrier interference and 
multiple user interference. The variance aj N is an unknown 
variable whose value is changing over the iterations. Before 
channel decoding, we employ a simple method for estimating 


cT j N in each iteration 
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The estimate aj N is used to replace cr 2 
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compute the evidence information. 


in ( 221 when we 
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The evidence information computed above is used to 
initialize the standard sum-product algorithm for channel 
decoding. We can derive the sum-product channel decoding 
algorithm as a message passing algorithm on the factor graph 
that models the encoding constraint C u . Readers familiar 
with the sum-product algorithm and factor graphs GJ can 
readily complete this task; we omit the details here. 


The First Stage — CFO Estimation 

The first stage updates the CFO with the phase drifts and 
the channel gains fixed to their last estimates. The new set 

XN (k Z _ 1 | — ^ 

of parameter estimates after the first stage is 3 ' = 


Ak,z) ('%k,z-l)\ M r(fc, 
c-n , S Vu.m r i ll u 

l ) m—1 


■- 1 ) 


is obtained by 


. The new CFO estimate 


Ultimate Form for Q Function 

Finally, using the Gaussian expression of the sample-wise 
APP shown in 0 - we can simplify the sample-wise Q 
function into a compact form (the derivation is given in 
Appendix A): 
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(29) 

where the objective function is the result of replacing the 


variables {0 Ujm }^ =1 and h u in the Q function (28 1 with the 


(24) 


i M 


{ 0^u,m ^ ) and h 

l J m=l 


(k,z— 1) 


From (24 1 , we note that, due to the Gaussian expression for 


the APP of Du^m^i in (21 1 , only the mean of the time-domain 
sample y u ,m,i appears in the sample-wise Q function. This 
greatly reduces the complexities involved in the computation 
of the Q function. With ( |24| , the overall Q function can now 
be written as 


fixed values 

The exhaustive search method for solving (|29j) is computa¬ 
tionally complex, since e u is a continuous variable. To obtain 


a practical solution for (291, we could approximate T {e u ) in 


(291) using its Taylor expansion around E* ’ 2 11 with terms 


above the second order truncated. We then differentiate the 


resulting objective function of (29 1 with respect to e u and set 
the derivative to zero. Solving the equation yields a closed- 
form solution for 


_ sfM- 1 ) 

&u — &U 

f M . s (k,z- 1) 
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(25) 

We denote the mean vector of the symbol vector X„ m 
b y m x u , m = K^ mtl ffljf, imt 2 -mx Vimi)f ] , whose i th 
element is computed using the symbol-wise APP 
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where 


r 

r" 


' (el fe ’ z " 1) ) = 0‘2 t t/N) 

(e i*’ 2_1) ) = ~(2n/N) 2 ^ 2 T 


With the above notations, we have the following relationship: 

m yu m = F ff m Y „ im = F ff D (m Xu m ) Fh u (27) 

Substituting ( |27] ) into ( (25) ) gives the ultimate form of the Q 
function: 


(28) 

So far, we have finished the E-step of ECM. We next turn to 
the M-step of ECM. 


(26) with d/ = diag {[0,1, 2, • • •, N — 1]}, are the first and 
second derivates of T (e u ) at the point E *’ 2 E The detailed 
derivation of @ can be found in Appendix B. With the 
updated CFO estimate, we go to the second stage of the 
M-step. 

The Second Stage — Phase Tracking 

The second stage updates the phase drifts with the CFO 
and the channel gains fixed to their last estimates. The 
new set of parameter estimates after the second stage is 

(fc, 2 -l+f) _ JEfc.z) 
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new phase estimates are given by 
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M-step of ECM: 

The M-step of ECM updates the new parameter estimates 
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by maximizing the 


Q function in (28 1 . ECM breaks the maximization procedure 
of the z th iteration into three stages, where the t th stage 

j 1 r -3—) Fi( k,z — 

updates the parameters from ' to ilf , 

t = 1 , 2 ,3, as follows: 


= arg max 
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J Fh 
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The objective function in (|3 1 [) is obtained from the Q function 


in (1281) e u = ei fe,2) with h u = hi*’" E 


Since we assume that the phase drifts [ =1 are 

independent for different blocks, we can decouple the problem 







































FREQUENCY-ASYNCHRONOUS MULTIUSER JOINT CHANNEL-PARAMETER ESTIMATION, CFO COMPENSATION AND CHANNEL DECODING 


12 


( f3l] > into M sub-problems 

a(k,z) 

Ou,m = arg max < 




(^’ z ^ F h D (m Xu , m ) 


Fh 


(k,z— 1) 


(32) 

for m = 1,2, one for each block. Directly solving 

(32 1 gives the phase estimate for the m th block: 

= ^[r ( 4 M) ) f h d KJ Fh^’- 1 )] 
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With the new phase estimates j, we go to 

stage. 


(33) 
the third 


The Third Stage — Channel Estimation 

The third state updates the channel gains with the CFO 
and the phase drifts fixed to their last estimates. The new 


set of parameter estimates after the third stage is O 
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The objective function in ( [34] ) is obtained from the Q function 


in (281 with e u = e. 
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solution for (34 1 is given by the least square (LS) 


estimate 
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This finishes the final stage of the M-step. We then iterate 
back to the E-step with the set of new parameter estimates 

4M) (oi k ’ z )\ AI C( fc >- 

i \ Uu.m c 5 ll u 
l ) 771=1 
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^ “77, - 


Initialization and Termination of ECM Iteration: 

We bootstrap the ECM iteration with initial estimates 

oE 0) = |el fc ’ 0) , ^, hi t ' 0) j-, where the initial 

CFO estimate and channel estimates e^’°\ hi fe,0) are obtained 

'^ik 0) 

from the preamble of user u; the initial phase estimate Q u 'm 
is obtained from the pilot subcarriers of the m th block for 
every m. 

We repeat the E-step and M-step of ECM iteratively. When 
the number of iterations z reaches the preset maximum limit Z, 
we terminate the ECM algorithm for user u and take the final 
estimates as the approximate solution for the joint estimation 
problem ( 8 ): 
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are the final parameter 


estimates obtained from the M-step of the Z th ECM iteration, 
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X« ’ are obtained by making hard decisions based on the 


symbol-wise APPs 
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A oper¬ 


ating flow chart for the proposed SAGE-ECM algorithm is 
shown in Fig. [2] 


I). Alternative Iterative Receiver 


We have described our proposed SAGE-ECM receiver 
in Section III.C. We can obtain an alternative receiver by 
adopting another message passing algorithm — the min-sum 
algorithm — for channel decoding. 

Let us revisit ([ 8 ]) and consider how to solve it iteratively. 
We can apply SAGE again in this single-user subproblem. 
Specifically, we can break up one iteration into several stages 
where one stage updates just one parameter while fixing all 
other parameters to their last estimates. This iterative receiver 
corresponds to a pure SAGE framework, where all variables 
are treated as parameters and updated in a sequential manner. 

Using the above idea, the z th iteration for solving ([ 8 ]) 
consists of four stages. In the first, second and third stages, the 


CFOs, the phases and the channel gains are updated to e 
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, we complete the z 


th 


The fourth stage in ( [36] ) corresponds to the channel decoding 
operation. In contrast to the channel decoding in the SAGE- 
ECM framework in Section III.C, where the sum-product al¬ 
gorithm was used to obtain the APPs of the transmit symbols, 
the channel decoding method corresponding to ( [36] ) is the min- 
sum algorithm G9- (nj. For example, if convolution codes 
are employed, the SAGE-ECM framework (in Section III.C) 
would use the BCJR algorithm, but the pure SAGE framework 
here would use the Viterbi algorithm (a special case of the 
min-sum algorithm m- mi)- 

Henceforth, we will refer to the iterative receiver in Section 
III.C as the SAGE-ECM Sum-Product Rx, and the iterative 
receiver here as the SAGE Min-Sum Rx. We compare their 
performances through simulation study in the next section. 

The key difference between the SAGE Min-Sum Rx in 
this section and the SAGE-ECM Sum-Product Rx in Section 
III.C is how channel decoding assists the channel-parameter 
estimation. For the SAGE Min-Sum Rx, the symbol estimates 
X„ ’ ’ given by min-sum channel decoding are hard decisions 
— we lose soft information on X u that specifies the levels 
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Fig. 2. The operating flow chart for the proposed SAGE-ECM algorithm. 

of confidence on our estimates. As will be seen in the 
next section, these hard decisions cause significant perfor¬ 
mance degradation when accepted as parameter estimates. The 
SAGE-ECM Sum-Product Rx assists the channel-parameter 
estimation using the soft information on the transmit symbols 
in a more sophisticated manner. Finally, we remark that the 
SAGE min-sum RX here is equivalent to the receiver proposed 
in 0 (see a detail discussion about 0 in Appendix C). 

E. Complexity Comparison 

We analyze and compare the complexities of our SAGE- 
ECM receiver and other receivers. For these receivers, which 
are OFDM based, the most computation intensive parts are 
the DFT/IDFT operations, expressed by the multiplications 
with matrix F or F H in the equations of this paper. For 
implementation, the DFT/IDFT operations can be realized 
using FFT/IFFT. The complexity of FFT/IFFT is O (N log N). 
The other operations involved in the receivers are scalar 
additions, multiplications, divisions whose complexities are 
linear with the frame size. Therefore, for comparison purposes, 
we count the numbers of FFT and IFFT operations in our 
SAGE-ECM receiver and other receivers. 

We first look at a traditional non-iterative estimation receiver 
where the channel gains and CFOs are estimated from the 
preamble, phases are estimated from the pilot subcarriers, and 


then an iterative interference cancellation approach (such as 
the one proposed in |T4] |) is used for decoding. There are 2U 
preamble symbols and M' = M — 2U data symbols in each 
frame. To process the preambles of all users, it is required to 
perform 2U FTT operations. Then, in each iteration of iterative 
interference cancellation, the decoding of one user needs to 
perform M' FFT for transforming the interference-cancelled 
time-domian signals to the frequency domain for decoding, 
and perform M' IFFT for transforming the decoded signal 
back to the time domain to be used to cancel interference 
by the next user. The number of iterations for interference 
cancellation is K. Totally, there are 2U + 2KUM' FTT/IFFT 
operations for this traditional non-iterative estimation receiver. 

We next look at our SAGE-ECM receiver. For the ECM 
iteration of one user, we need to perform channel decoding 
in the frequency domain; then convert the M' mean vec¬ 
tors of frequency-domain data symbol vectors to the time 
domain for parameter estimations; after that, we convert the 
M' compensated time-domain signals back to the frequency 
domain for channel decoding in the next round of ECM 
iteration. Therefore, within one ECM iteration of one user, 
we need to perform 2M' FFT/IFFT operations. Since we have 
U users, Z ECM iterations and I\ SAGE iterations, in total 
we need 2 KZUM' FTT/IFFT operations for all the SAGE- 
ECM iterations. To initialize the SAGE-ECM iteration, we also 
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need to process the preambles of users. This also needs 2(7 
FTT operations. Totally, there are 2U + 2KZUM 1 FTT/IFFT 
operations for the SAGE-ECM receiver. From these results, we 
can conclude that the computation complexity of our SAGE- 
ECM receiver is approximately Z times larger than that of the 
traditional non-iterative estimation receiver. 

The pure SAGE receiver discussed in Section III.D needs 
the same number of FFT/IFFT operations as our SAGE- 
ECM receiver. Compared with our SAGE-ECM receiver, the 
complexity of the pure SAGE receiver is just slightly less 
computation intensive thanks to its simpler min-sum channel 
decoding: both receivers, however, have the same order of 
complexity in the frame size. 

IV. Simulation and Experimental Results 

This section presents simulation results and experimental 
results on software defined radio. 

A. Simulation Results 

In all simulations, the frame format is a slightly modified 
version of the 802.11a frame format {30). The DFT size is 
N = 64. The CP is of length N cp = 16. Among the N = 64 
subcarriers, there are N d = 48 data subcarriers, N p = 2U (U 
is the number of users) pilot subcarriers and 64 — N,i — N p 
unused guard band subcarriers. Each user transmits known 
symbols on 2 of the 2 U pilot subcarriers, and nulls the signal 
on the other 2U—2 pilot subcarriers. The modulation is BPSK. 
A way to realize a low-rate channel code scheme for IDMA 
is to serially concatenate a forward error correction (FEC) 
code with a repetition code |2'|. In our simulation, the FEC is 
a regular Repeat Accumulate (RA) code HD with code rate 
R\ = 1/3, and the code rate of the repetition code is R > = 
1/(7. Therefore, the overall code rate is R = R \ R 2 = 1 /3C7. 
The interleavers of all three users are generated randomly. The 
payload of each frame has 2400 bits. The two preamble blocks 
for each user are two successive copies of the long training 
sequence (LTS) defined in the 802.11a standard. However, as 
described in Section II.A, the two preambles of different users 
occupy different block times. 

We set the length of the discrete channel vectors for all 
three users to L\ = L 2 = L 3 = 4. The channel taps are 
generated according to the channel mode given in Section 
II.B. We assume that the receiver of the base station can 
capture the first channel path of user 1, so that the timing 
mismatch between user 1 and the base station is set to p\ = 0 . 
Furthermore, other p, u are randomly chosen from the interval 
[0,9]. Thus, the loose time synchronization requirement is 
satisfied in simulations. We set the CFO of each user to p 
or —p with equal probability, where p is the so-called CFO 
attenuation factor G3- and it is a deterministic parameter 
ranging in [0,0.5]. In our simulations, we vary the value of 
p to investigate its impact on system performance. SNR is 
defined as E{,/Nq, where h), is the energy per information bit 
and No is the noise variance. All simulation results presented 
here are obtained by averaging over 3 x 10 3 frames. 

For performance comparison, we investigate the following 
four approaches for OFDM-IDMA systems: (i) the iterative 


interference cancellation approach in fT4) with perfect knowl¬ 
edge of channel parameters (Full CSI Rx); (ii) the iterative 
interference cancellation approach in |[14) with the one-shot 
channel parameter estimations obtained from the preambles 
and pilots (One-shot Est. Rx); (iii) the SAGE Min-Sum Rx 
approach described in Section III.D as a benchmark; and 
(iv) the SAGE-ECM Sum-Product Rx approach proposed 
in Section III.C. For all receivers, we performed iterations 
until the algorithm converged (with SAGE iteration number 
K = 10, and ECM iteration number Z = 20). We remark 
that the performance of One-shot Est. Rx is the initial point 
of SAGE-ECM Sum-Product Rx. 

Fig. 0 presents the results of bit error rate (BER) versus 
SNR. The CFO attenuation factor is fixed to p = 0.2. The 
systmes with users U = 2, 3,4 are simulated. From the results, 
we first note that the BER gap between Full CSI Rx and One- 
shot Est. Rx is large. Specifically, the estimation errors of 
one-shot estimation induce around 13 dB SNR loss at BER 
= 10- 5 . Intuitively, since each imperfect parameter estimate 
induces an SNR penalty and we have many channel parameters 
in the system, the SNR penalties accumulate to an overall large 
penalty. 

The joint channel-parameter estimation, CFO compensation 
and channel decoding approachs reduce the performance loss, 
as shown in Fig. [3] Compared to one-shot Est. Rx, SAGE- 
ECM Sum-Product Rx (proposed) yields 8 dB SNR improve¬ 
ment and SAGE Min-Sum Rx (benchmark) yields 5dB SNR 
improvement at BER = 10 -5 . The 3 dB performance gap 
between SAGE-ECM Sum-Product Rx and SAGE Min-Sum 
Rx justifies that the belief about data symbol given by the sum- 
product channel decoding can assist the channel-parameter 
estimation procedures in a better manner. 

We also evaluate the mean square error (MSE) of the 
estimated channel parameters. Fig. [4] Fig.[5]and Fig. [ 6 ]present 
the MSEs of the estimated CFOs, channels and phases versus 
SNR. The CFO attenuation factor is fixed to p = 0.2. The 
units of the CFOs and the phases are Hz and radian. The 
systmes with users U = 2,3,4 are simulated. From the 
MSE results Fig. [4]|6] we clearly see that the SAGE-ECM 
approaches do have more accurate estimates than the tradi¬ 
tional preamble/pilot-based one-shot estimations. Moreover, 
the estimations of SAGE-ECM Sum-Product Rx are better 
than that of SAGE Min-Sum Rx; the difference in MSEs 
between them decreases as the SNR increases. The reason is 
that the results of min-sum decoding are hard decisions. In the 
low SNR regime, the hard decisions are not reliable enough 
and will propagate the decoding errors to channel parameter 
estimations. In the high SNR regime, the decoding results of 
min-sum decoding approaches that of sum-product decoding, 
thus the error gaps in MSEs become narrow. 

Fig. 0 shows the impact of CFO attenuation factor p on 
BER, where SNR is fixed to 16 dB and the system includes 
(7 = 3 users. The results again confirm the better performance 
of SAGE-ECM Sum-Product Rx over One-shot Est. Rx and 
SAGE Min-Sum Rx. It also shows that the BERs of all ap¬ 
proaches are insensitive to p: note that the different approaches 
have different performances; just that the performance of each 
approach is not sensitive to p. This result implies that it is not 
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Fig. 3. BER versus SNRs with p = 0.2. 


Fig. 4. CFO MSEs versus SNRs with p = 0.2. 
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Fig. 5. Channel gain MSEs versus SNRs with p = 0.2. 


Fig. 6. Phase MSEs versus SNRs with p = 0.2. 


the magnitude of that affects performance; it is the estimation 
errors of p, which vary among the different approaches. 

Fig. [8) Fig. [9] and Fig. [TO] show the MSEs of the estimated 
CFOs, channels and phases versus p. The SNR is fixed to 16 
dB. The system includes 3 users. The performance trends are 
the same with the BER in terms of p, as shown in Fig. [7] 
SAGE-ECM Sum-Product Rx has better MSE performances 
than SAGE Min-Sum Rx and One-shot Est. Rx do. For all 
approaches, their estimation errors depends only weakly on p 
for a wide range of p\ performances only degrade as p is near 
0.5 (the worst case). We thus conclude that it is the error in 
the estimation of CFO that has more effects than the actual 
CFO value itself. 


B. Experimental Results 

Going beyond simulations, we also verify our proposed 
approach experimentally. We implemented an OFDM-IDMA 
system using a software defined radio (SDR) platform. We 
collected the data for the received signal from the experimental 


system and evaluate the performance of the proposed approach 
using the collected data. 


The experimental system is built on the USRP N210 hard¬ 
ware |32l and the GNU Radio software with the UF1D hard¬ 


ware driver [331. We emulated an OFDM-IDMA system that 
includes one base station and two users (U = 2), by deploying 
three sets of USRP N210 with XCVR2450 boards (32) in 
our lab^The base station used 802.11 channel 1 (2.412GHz) 
to poll the two users to transmit together at channel 11 
(2.462GHz), thereby achieving the loose-time synchronization 
mentioned in Section II.C. The wireless channel bandwidth of 
our network is 4MHz. The use of 4 MHz bandwidth rather 
than a wider bandwidth is due to the limitation of the USRP 
hardware. After the terminal users received a beacon from the 


5 We performed the experiment for OFDM-IDMA using the SDR prototype 
of the physical-layer network coding (PNC) systems reported in detail in 
id m In the uplink of PNC, two users transmit signals to the relay 
simultaneously. This is similar to multiple-access systems. Therefore, we can 
borrow it for our use here. 
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Fig. 7. BER versus p with SNR=16 dB, (7 = 3. 



Fig. 9. Channel gain MSEs versus p with SNR=16 dB, (7 = 3. 



Fig. 11. Experimental BER results with (7 = 2. 



Fig. 8. CFO MSEs versus p with SNR=16 dB, (7 = 3. 



Fig. 10. Phase MSEs versus p with SNR=16 dB, (7 = 3. 
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base station, they then transmit their signals simultaneously. 
The base station received the simultaneous transmissions and 
converted them to digital data for processing. We performed 
controlled experiments for different received SNRs. The re¬ 
ceive powers of frames from two users at the base station were 
adjusted to be balanced (power imbalance within ldB). The 
base station transmits 100 beacons to trigger 100 simultaneous 
uplink transmissions for each fixed SNR. 

The frame format used is similar to the one used in 
simulations except for the following two differences. First, in 
the experimental setup, there are four pilot subcarriers within 
each OFDM block (as opposed to six in simulations) and each 
user transmits pilot symbols over two of them. Second, the 
channel coding scheme used is only the 1/3 coding rate RA 
code without the repetition code. Each frame includes 256 
OFDM blocks. 

The experimental BER results are shown in Fig. ITT] The 
frame error rate (FER) results are shown in Fig. |12| We 
compare the performances of SAGE-ECM Sum-Product Rx 
with One-shot Est. Rx. Generally, the performance trends are 
similar to those in our simulation results. Specifically, we 
observe that SAGE Sum-Product Rx achieves around 5 dB 
SNR gain over One-shot Est. Rx. at the BER of 10 -5 in 
the experiment. We note that the shapes of the experimental 
BER curves are different from those of the simulations in 
Fig .0 The reason is that the channel in our experimental 
environment is rather flat over the 4 MHz bandwidth. To 
verify our experimental results, we perform an additional 
simulation where the system includes U = 2 users, the frame 
format is the same as that in our experiment, the channel has 
one Rayleigh path (thus it is flat), and the CFOs are set to 
[£i,S 2 ,] T = [0.06,0.11] T that are the means of the measured 
CFO values in our experiment. The simulated BER and FER 
results under this setup are presented in Fig. 11 and Fig. 


12 respectively. As can be seen, the simulated results are 


consistent with the experimental results in that there is also 
around 5 dB SNR gain by SAGE-ECM Sum-Product Rx over 
One-shot Est. Rx. 


V. Conclusion 

We have put forth a framework that employs the SAGE 
and ECM algorithms to solve the problem of joint channel- 
parameter estimation, CFO compensation and channel decod¬ 
ing in multiuser OFDM-IDMA systems. Our framework is 
motivated by the fact that for multiuser OFDM systems, (i) 
one-shot non-iterative parameter estimation does not yield 
satisfactory accuracy; and (ii) one-shot non-iterative CFO 
compensation is impossible. For these reasons, we propose 
to solve the overall problem of channel-parameter estimation, 
CFO compensation, and channel decoding jointly and itera¬ 
tively. 

Within our framework, we compared different schemes of 
assigning the role of hidden data in ECM, and concluded that 
treating the data symbols (as opposed to channel parameters) 
as the hidden data in ECM leads to better performance. For 
the consistency and completeness of this scheme, we bridge 
the time-domain channel-parameter estimation procedure and 


the frequency-domain channel decoding procedure using “soft 
IDFT”. 

Our simulation results and real SDR experimental results 
indicate that compared with the traditional multiuser approach, 
the proposed approach can obtain 5 dB SNR gain for the two- 
user case, and 8 dB SNR gain for the three-user case. 


Appendix 


A. Derivation o/m 

We first expand the sample-wise Q function in ( |76| ) as 
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where we have already dropped the constant term. Then, we 
can easily compute the following expectations 
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where the mean and variance of y u ,m,i are given in the 
Gaussian expression for p 
Substituting (38 1 in to (37], we have 
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Since cr^ . is not relevant to D u . We just drop it. This gives 
the ultimate form of Q function in (|24|. 
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B. Derivation 

We denote the objective function in 


by 


/(e«) = 
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Expanding and dropping the terms irrelevant to e u , we obtain 
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We choose to approximate T (e u ) in the objective function 
using its Taylor series expansion 
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where we truncate the third-order terms and above and ei fe,z-1 
is the starting point. Substituting ( |42] > into © and differen¬ 
tiating the resulting / (e u ) with respect to s u yield 
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Finally, setting df ( £ u )/d£ u = 0 and solving the equation, we 
obtain the new CFO update shown in ( |30| . 

C. Discussion of Receiver in © 

Ref. fl7) proposed an iterative receiver for solving the prob¬ 
lem of joint channel-parameter estimation, CFO compensation 
and data detection in uncoded OFDMA systems. The authors 
of fl7) attempted to construct the receiver based on a SAGE- 
ECM framework. As explained later in this appendix, this 
attempt ended up with a pure SAGE framework due to an 
approximation. 

A major difference between the framework in © and ours 
is the assignment of hidden data in the single-user subproblem 
within the SAGE-ECM framework. For us, the data symbols 
X„ are the hidden data and the channel gains h„ are one of the 
parameters. For in © the channel gains h„ are the hidden 
data, and the data symbols X„ are one of the parameters. 

Another difference is that, unlike our work here, © 
did not incorporate channel coding/decoding — © only 
focused on detection of individual uncoded OFDM blocks 
X u m , and the correlations among different data blocks of the 
overall frame X,, induced by channel coding are not exploited 
to further improve performance. In particular, each OFDM 
symbols are decoded independently. Here, for consistency 
with our problem, we extend the treatment of © to coded 
OFDM-1DMA systems (i.e., we incorporate channel coding 
into the framework of 1171)- With this extension, we have 
n' u = | e u , {0 u ,m}m=i ■ x «} as the parameters and h u as the 
hidden data. 


The corresponding ECM aims to solve the following prob¬ 
lem iteratively: 
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Now, ECM solves (43 1 iteratively. In the z th iteration of 
ECM, the E-step of ECM computes the Q function of 
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and the M-step of ECM updates the tentative estimates 

■^~~(k,z— 1) 

f V n , for Q u by finding the tt u that maximize (431. 


The exact computation of the Q function in ( [44] ) is complex 
and not feasible from the implementation viewpoint (cf .eq 
(13) in fTT] for the Q function of the uncoded case). To reduce 
complexity, © made an approximation on the computed Q 
function. Extending the result in [ 171 to channel-coded case 
(cf. eq (16) in ©), the E-step of ECM approximates the Q 
function as 
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is the LS estimate of h. 


(45) 

obtained 


in the signal model (JE]). With this Q 


function, the CFO and the phases can be updated to £■ 


k,z ) 


, M 


and using the same methods of (30 1 and (33 I 

in Section Illjfc, respectively. However, the transmit symbols 
X„ here is updated in a different way than in Section III.C. 
Specifically, the stage in the M-step for updating the transmit 
symbols now becomes ( [36] ), which can be solved using the 
min-sum channel decoding, as explained in Section III.D. 

A subtle but important consequence of approximating the 
Q function as in ( [45] ) is that, although the development of 
the receiver in ]T7jbegins with the ECM algorithm, it ends 
up with a pure SAGE solution to (| 8 ]. In other words, it 
is not a SAGE-ECM algorithm anymore, but a pure SAGE 
algorithm. The algorithm essentially treats all the variables in 
{0u,m}^f =1 , h u , X u | as parameters, and none of them 
as hidden data. As in the SAGE algorithm, the parameters 
are estimated sequentially one at a time; when one parameter 
is under estimation, all other parameters are fixed to their 
estimates from the last iteration. This pure SAGE algorithm 
is exactly the same as that we discussed in Section III.D and 
adopted as one of the benchmarks in our simulation studies in 
Section IV. 
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